gated recurrent unit
Temporal Graph Neural Networks for Early Anomaly Detection and Performance Prediction via PV System Monitoring Data
Mukherjee, Srijani, Vuillon, Laurent, Nassif, Liliane Bou, Giroux-Julien, Stéphanie, Pabiou, Hervé, Dutykh, Denys, Tsanakas, Ionnasis
Effective performance prediction and timely anoma ly detection are paramount to ensuring the long - te rm efficiency, reliability, and economic viability of these systems. Traditional monitoring methods, often based on simple thresho lds or statistical rules, frequently fail to account for the complex interplay of environmental and operational variables that affect PV performance. These methods may lead to high rates of false positives or, more critically, miss subtle but significant a nomalies that can indicate underlying system faults. To overcome these limitations, advanced data - drive n approaches are essential. Machine learning and deep learning models have shown promise in this field, offering the ability to learn complex, non - linear relationships from vast datasets.
- Europe > France > Auvergne-Rhône-Alpes > Lyon > Lyon (0.05)
- Europe > France > Auvergne-Rhône-Alpes > Isère > Grenoble (0.04)
- Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.04)
Preventing Gradient Explosions in Gated Recurrent Units
A gated recurrent unit (GRU) is a successful recurrent neural network architecture for time-series data. The GRU is typically trained using a gradient-based method, which is subject to the exploding gradient problem in which the gradient increases significantly. This problem is caused by an abrupt change in the dynamics of the GRU due to a small variation in the parameters. In this paper, we find a condition under which the dynamics of the GRU changes drastically and propose a learning method to address the exploding gradient problem. Our method constrains the dynamics of the GRU so that it does not drastically change. We evaluated our method in experiments on language modeling and polyphonic music modeling. Our experiments showed that our method can prevent the exploding gradient problem and improve modeling accuracy.
- Media > Music (0.62)
- Leisure & Entertainment (0.62)
Convolutional Spiking-based GRU Cell for Spatio-temporal Data
Abdennadher, Yesmine, Cicciarella, Eleonora, Rossi, Michele
Spike-based temporal messaging enables SNNs to efficiently process both purely temporal and spatio-temporal time-series or event-driven data. Combining SNNs with Gated Recurrent Units (GRUs), a variant of recurrent neural networks, gives rise to a robust framework for sequential data processing; however, traditional RNNs often lose local details when handling long sequences. Previous approaches, such as SpikGRU, fail to capture fine-grained local dependencies in event-based spatio-temporal data. In this paper, we introduce the Convolutional Spiking GRU (CS-GRU) cell, which leverages convolutional operations to preserve local structure and dependencies while integrating the temporal precision of spiking neurons with the efficient gating mechanisms of GRUs. This versatile architecture excels on both temporal datasets (NTIDIGITS, SHD) and spatio-temporal benchmarks (MNIST, DVSGesture, CIFAR10DVS). Our experiments show that CS-GRU outperforms state-of-the-art GRU variants by an average of 4.35%, achieving over 90% accuracy on sequential tasks and up to 99.31% on MNIST. It is worth noting that our solution achieves 69% higher efficiency compared to SpikGRU. The code is available at: https://github.com/YesmineAbdennadher/CS-GRU.
- Europe > Middle East > Republic of Türkiye > Istanbul Province > Istanbul (0.40)
- Asia > Middle East > Republic of Türkiye > Istanbul Province > Istanbul (0.40)
- North America > Cuba (0.06)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
GRAD: Real-Time Gated Recurrent Anomaly Detection in Autonomous Vehicle Sensors Using Reinforced EMA and Multi-Stage Sliding Window Techniques
Naeimi, Mohammad Hossein Jafari, Norouzi, Ali, Abdi, Athena
This paper introduces GRAD, a real-time anomaly detection method for autonomous vehicle sensors that integrates statistical analysis and deep learning to ensure the reliability of sensor data. The proposed approach combines the Reinforced Exponential Moving Average (REMA), which adapts smoothing factors and thresholding for outlier detection, with the Multi-Stage Sliding Window (MS-SW) technique for capturing both short- and long-term patterns. These features are processed using a lightweight Gated Recurrent Unit (GRU) model, which detects and classifies anomalies based on bias types, while a recovery module restores damaged sensor data to ensure continuous system operation. GRAD has a lightweight architecture consisting of two layers of GRU with a limited number of neurons that make it appropriate for real-time applications while maintaining high detection accuracy. The GRAD framework achieved remarkable performance in anomaly detection and classification. The model demonstrated an overall F1-score of 97.6% for abnormal data and 99.4% for normal data, signifying its high accuracy in distinguishing between normal and anomalous sensor data. Regarding the anomaly classification, GRAD successfully categorized different anomaly types with high precision, enabling the recovery module to accurately restore damaged sensor data. Relative to analogous studies, GRAD surpasses current models by attaining a balance between elevated detection accuracy and diminished computational expense. These results demonstrate GRAD's potential as a reliable and efficient solution for real-time anomaly detection in autonomous vehicle systems, guaranteeing safe vehicle operation with minimal computational overhead.
- Europe > Switzerland > Zürich > Zürich (0.05)
- North America > United States > Arizona > Maricopa County > Anthem (0.04)
- Asia > Middle East > Israel > Haifa District > Haifa (0.04)
- Asia > Middle East > Iran > Tehran Province > Tehran (0.04)
- Information Technology > Security & Privacy (1.00)
- Automobiles & Trucks (1.00)
- Transportation > Ground > Road (0.93)
- Information Technology > Data Science > Data Mining > Anomaly Detection (1.00)
- Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
- Information Technology > Architecture (1.00)
Predicting Road Crossing Behaviour using Pose Detection and Sequence Modelling
Dasgupta, Subhasis, Saha, Preetam, Roy, Agniva, Sen, Jaydip
The world is rapidly advancing toward a future where artificial intelligence (AI) takes a central role in many everyday activities. In business, for example, robots have become indispensable in manufacturing processes and warehouse management. These robots efficiently handle tasks such as stacking and removing items, o ptimizing various business operations. In aviation, autopilot systems have been a standard feature in airplanes for many years, enhancing flight safety and efficiency. Similarly, in many developed countries, vehicles equipped with autopilot capabilities ar e becoming increasingly common. These self - driving vehicles are designed with an array of sensors and high - resolution cameras to monitor their surroundings, detect objects, and take necessary actions to prevent collisions or accidents. While these autonomous vehicles perform admirably on highways where the primary concern is other vehicles, they face significant challenges in busy urban environments. In such settings, it is often advisable for drivers to switch from autopilot to manual c ontrol. This is particularly crucial in bustling market areas where pedestrian behaviour can be unpredictable.
- Asia > India > West Bengal > Kolkata (0.05)
- Africa > Middle East > Tunisia > Sfax Governorate > Sfax (0.04)
- Transportation > Air (0.54)
- Transportation > Ground > Road (0.46)
Benchmarking Traditional Machine Learning and Deep Learning Models for Fault Detection in Power Transformers
Saravanan, Bhuvan, D, Pasanth Kumar M, Vengateson, Aarnesh
Accurate diagnosis of power transformer faults is essential for ensuring the stability and safety of electrical power systems. This study presents a comparative analysis of conventional machine learning (ML) algorithms and deep learning (DL) algorithms for fault classification of power transformers. Using a condition-monitored dataset spanning 10 months, various gas concentration features were normalized and used to train five ML classifiers: Support Vector Machine (SVM), k-Nearest Neighbors (KNN), Random Forest (RF), XGBoost, and Artificial Neural Network (ANN). In addition, four DL models were evaluated: Long Short-Term Memory (LSTM), Gated Recurrent Unit (GRU), One-Dimensional Convolutional Neural Network (1D-CNN), and TabNet. Experimental results show that both ML and DL approaches performed comparably. The RF model achieved the highest ML accuracy at 86.82%, while the 1D-CNN model attained a close 86.30%.
Improving Deep Knowledge Tracing via Gated Architectures and Adaptive Optimization
Deep Knowledge Tracing (DKT) models student learning behavior by using Recurrent Neural Networks (RNNs) to predict future performance based on historical interaction data. However, the original implementation relied on standard RNNs in the Lua-based Torch framework, which limited extensibility and reproducibility. In this work, we revisit the DKT model from two perspectives: architectural improvements and optimization efficiency. First, we enhance the model using gated recurrent units, specifically Long Short-Term Memory (LSTM) networks and Gated Recurrent Units (GRU), which better capture long-term dependencies and help mitigate vanishing gradient issues. Second, we re-implement DKT using the PyTorch framework, enabling a modular and accessible infrastructure compatible with modern deep learning workflows. We also benchmark several optimization algorithms SGD, RMSProp, Adagrad, Adam, and AdamW to evaluate their impact on convergence speed and predictive accuracy in educational modeling tasks. Experiments on the Synthetic-5 and Khan Academy datasets show that GRUs and LSTMs achieve higher accuracy and improved training stability compared to basic RNNs, while adaptive optimizers such as Adam and AdamW consistently outperform SGD in both early-stage learning and final model performance. Our open-source PyTorch implementation provides a reproducible and extensible foundation for future research in neural knowledge tracing and personalized learning systems.
- North America > Canada > Ontario > Toronto (0.15)
- North America > United States > Virginia (0.05)
- Education > Educational Technology > Educational Software > Computer Based Training (0.89)
- Education > Educational Setting (0.71)
Estimating Vehicle Speed on Roadways Using RNNs and Transformers: A Video-based Approach
Mareddy, Sai Krishna Reddy, Upplapati, Dhanush, Antharam, Dhanush Kumar
This project explores the application of advanced machine learning models, specifically Long Short-Term Memory (LSTM), Gated Recurrent Units (GRU), and Transformers, to the task of vehicle speed estimation using video data. Traditional methods of speed estimation, such as radar and manual systems, are often constrained by high costs, limited coverage, and potential disruptions. In contrast, leveraging existing surveillance infrastructure and cutting-edge neural network architectures presents a non-intrusive, scalable solution. Our approach utilizes LSTM and GRU to effectively manage long-term dependencies within the temporal sequence of video frames, while Transformers are employed to harness their self-attention mechanisms, enabling the processing of entire sequences in parallel and focusing on the most informative segments of the data. This study demonstrates that both LSTM and GRU outperform basic Recurrent Neural Networks (RNNs) due to their advanced gating mechanisms. Furthermore, increasing the sequence length of input data consistently improves model accuracy, highlighting the importance of contextual information in dynamic environments. Transformers, in particular, show exceptional adaptability and robustness across varied sequence lengths and complexities, making them highly suitable for real-time applications in diverse traffic conditions. The findings suggest that integrating these sophisticated neural network models can significantly enhance the accuracy and reliability of automated speed detection systems, thus promising to revolutionize traffic management and road safety.
- North America > United States > North Carolina (0.05)
- North America > United States > Tennessee > Davidson County > Nashville (0.04)
- Europe > Finland > Pirkanmaa > Tampere (0.04)
- Education > Educational Setting (0.48)
- Transportation > Ground > Road (0.34)
Integrative Analysis of Financial Market Sentiment Using CNN and GRU for Risk Prediction and Alert Systems
Wu, You, Sun, Mengfang, Zheng, Hongye, Hu, Jinxin, Liang, Yingbin, Lin, Zhenghao
This document presents an in-depth examination of stock market sentiment through the integration of Convolutional Neural Networks (CNN) and Gated Recurrent Units (GRU), enabling precise risk alerts. The robust feature extraction capability of CNN is utilized to preprocess and analyze extensive network text data, identifying local features and patterns. The extracted feature sequences are then input into the GRU model to understand the progression of emotional states over time and their potential impact on future market sentiment and risk. This approach addresses the order dependence and long-term dependencies inherent in time series data, resulting in a detailed analysis of stock market sentiment and effective early warnings of future risks.
- North America > Trinidad and Tobago > Trinidad > Arima > Arima (0.05)
- Asia > China > Hong Kong (0.04)
- North America > United States > Arizona (0.04)
Reviews: Preventing Gradient Explosions in Gated Recurrent Units
Summary The authors propose a method for optimizing GRU networks which aims to prevent exploding gradients. They motivate the method by showing that a constraint on the spectral norm of the state-to-state matrix keeps the dynamics of the network stable near the fixed point 0. The method is evaluated on language modelling and a music prediction task and leads to stable training in comparison to weight clipping. Technical quality The motivation of the method is well developed and it is nice that the method is evaluated on two different real-world datasets. However, one important issue I have with the evaluation is that the learning rate is not controlled for in the experiments. Unfortunately, this makes it hard to draw strong conclusions from the results.